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ABSTRACT 

Noting that the United States Army Research Institute 
for the Behavioral and Social Sciences (ARI ) has been, involved in 
research on reading assessment in the Army from both practical and 
theoretical perspectives, this paper addresses practical Army 
problems in reading assessment from a theory base that reflects the 
most recent and most sound research on reading comprehension. The 
first two sections of the paper, which concern the importance of 
reading in work life and the importance of reading assessment in the 
Army, provide the basis for addressing the overall issue of reading 
assessment in the Army. The reading theories presented in the third 
section are helpful in generating evaluative criteria for reviewing 
potential measures of reading for Army use. In the fourth and fifth 
sections, three categories of measures are considered: civilian, 
military, and the special case of the Armed Services Vocational 
Aptitude Battery (ASVAB) — a military test of general cognitive 
ability with potential value as a surrogate rending measure. The 
sixth section deals with scoring issues that should be addressed 
regardless of which measure is used. The final section offers 
conclusions based on both theory and practice: (1) reading is 
important in military and civilian work life; (2) reading assessment 
is a highly visible and important issue in the Army; (3) reading 
theories — especially the interactive-inferential theory — can 
positively influence Army reading measurement practices; (4) reading 
tests differ widely in terms of psychometric characteristics and 
overall quality as evaluated using theory-based standards; (5) high 
correlations exist between ASVAB and various reading tests, although 
caution should be exercised when using any part of the ASVAB as a 
reading surrogate; and (6) available alternative grade equivalent 
scores s? «>uld be considered for use by the Army. (A chart providing 
data and quality ratings on commercially developed reading tests and 
selected military developed reading tests is appended.) (HTH) 
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FOREWORD 



The Manpower and Personnel Research Laboratory of the U.S. Army Research 
Institute for the Behavioral and Social Sciences (ARI) conducts research in 
areac related to manpower and personnel issues. One key issue is reading 
performance of Army soldiers as shown through reading assessment. Reading 
assessment has become an important problem not just for training but also for 
personnel selection and classification. This report addresses Army reading 
assessment from a theory base that is both current and useful. The research 
was conducted under ARI Project Number 2Q263731A791. The information reported 
here will be helpful to policymakers and scientists concerned with military 
selection, classification, training, and education. 




EDGAR M. JOHNSON 
Technical Director 
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A THEORY-BASED APPROACH TO READING ASSESSMENT IN THE ARMY 



EXECUTIVE SUMMARY 



Requirement: 

To address practical Army problems in reading assessment from a theory 
base that reflects the most recent and most sound research on reading 
comprehension. 



Procedure: 

Specific aspects of the Army reading assessment problem were identified. 
The most current and relevant reading theory and its implications for reading 
assessment in general were then explained. Finally, the relationship of those 
implications to Army reading assessment was described. 



Findings: 

Six major conclusions are drawn from both theory and practice. First, 
reading is important in military and civilian work life. Second, reading 
assessment is a highly visible and important issue in the Army. Third, 
reading theories, especially the new interactive- inferential theory, can 
positively influence reading measurement practices in the Army. Fourth, 
reading tests differ widely in terms of psychometric characteristics and 
overall quality as evaluated by theory-based standards. Fifth, the Armed 
Services Vocational Aptitude Battery (ASVAB) and various reading tests have a 
high correlation, although caution needs to be exercised in substituting any 
part of the ASVAB for a reading test. Sixth, alternatives to grade equivalent 
scores are available and should be considered for use in the Army. 



Utilization of Findings: 

This report has utility for Army selection, classification, training, and 
education. The findings may also be helpful to other military services con- 
cerned with reading assessment issues. 
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A THEORY-BASED APPROACH 
TO READING ASSESSMENT IN THE ARMY 



The U.S. Army Research Institute for the Behavioral and Social Sciences 
(ARI) has been involved in research on reading assessment in the Army from 
both practical and theoretical perspectives* The purpose of this paper is to 
address practical Army problems in reading assessment from a theory base that 
reflects the most recent and most sound research on reading comprehension. 
This paper has seven sections. The first two sections, which concern the im- 
portance of reading in work life and the importance of reading assessment in 
the Array, provide the basis for addressing the overall issue of reading as- 
sessment in the Army. The reading theories presented in the third section arc 
helpful in generating evaluative criteria for reviewing potential measures of 
reading for Army use. in the fourth and fifth sections, three categories of 
measures are considered: civilian, military, and the special case of the 
Armed Services Vocational Aptitude Battery (ASVAB) — a military test of general 
cognitive ability with potential value as a surrogate reading measure. The 
sixth section deals with scoring issues that need to be addressed no matter 
what measure is used. The final section offers conclusions based on both 
theory and practice. 



READING IS IMPORTANT IN MILITARY AND CIVILIAN WORK LIFE 

Military research on reading has shown that reading is an inportant part 
of military life. Sacher and Duffy (1978) found that military workers 1 over- 
all job performance suffered if they read more than two grade levels below job 
demands. Sticht (1980) observed a low generalizability of general reading 
skills to specific literacy tasks in the military and emphasized the need for 
job-related reading tests. Sticht (1982) also investigated empirical rela- 
tions between reading proficiency and job proficiency in the military setting 
and found that the armeJ services pose demands for basic skills equal to or 
greater than those of civilian jobs. Even with higher enlistment standards, 
many military recruits are below average in basic skills, including not only 
reading but also other skills. However, Sticht (1982) found that basic skills 
competence as measured by a variety of tests did not appear to be the over- 
riding determinant of success in the military. The precise relationship be- 
tween reading skills and job skills needs further investigation. 

in addition to military research, civilian research may have some im- 
plications for Army literacy. Diehl and Mikulecky (1980) observed 100 workers 
representing a cross-section of occupations and found that 90% of the workers 
participated in some form of reading each day. in a later study, Mikulecky 
(1982) compared high school reading to work reading and found that students 
read less for school than most workers did for work. Workers read more often 
for application or to make judgments, while students read to gather facts. 
Mikulecky and Strange (in preparation) noted that the workplace calls for a 
wider range of literacy strategies than does school literacy, 95% of which is 
based on textbooks. These civilian research results may, by indication, 
underscore the in$>ortance of reading in many, if not all, military jobs. 
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READING ASSESSMENT IS IMPORTANT IN THE ARMY 



in the last few years reading assessment has become a key issue in the 
Army primarily because of changes in the reading ability levels of the Army 
population. Population shifts are, in turn, related to a multitude of fac- 
tors, such as changes in the national economic picture and variations in 
norming of the ASVAB used for Army selection and classification, in recent 
years the Training and Doctrine Command (TRADOC), the Forces Command (PORSCOM), 
the Education Directorate of The Adjutant General 1 s Office (TAGO), and the 
Office of the Deputy chief of Staff for Personnel (ODCSPER) have all turned to 
ARI for practical advice on reading assessment. For example, both TRADOC and 
FORSCOM have asked for ARI f s help in measuring the reading level of noncom- 
missioned officers (NCOs). TRADOC Systems Analysis Activity (TRASANA) has 
requested assistance in interpreting its data on reading tests of officers and 
enlisted personnel. TRADOC is currently establishing a program to improve 
officers 1 communication skills and has asked for ARI's advice on the use of 
reading tests as part of the program. TAGO is funding an ARI project, known 
as the Job Skills Education Program (JSEP), a computer-based effort that in- 
volves the assessment of job-oriented basic skills. ARI has also advised 
ODCSPER on the difficulties of using reading as a reenlistment criterion for 
midterm noncommissioned officers. 

Army agencies often ask the following questions about reading assess- 
ment: 'What reading tests can we use?* "What do our reading test scores 
mean?" and "Since ASVAB scores relate to reading test scores, can the ASVAB be 
used as a substitute for a reading measure!" Recent requests for assistance 
have not focused on the more central question, "How does reading proficiency 
relate to job proficiency?" The last question— perhaps the most crucial ques- 
tion for Army reading assescment— is an empirical one that research has not 
yet answered adequately. However, both theory and research provide answers to 
the first three questions. Before addressing these questions in detail, we 
will summarize several key types of reading theories and indicate their impli- 
cations for reading assessment. 



THREE TYPES OF READING THEORIES ARE EXAMINED 

Many theories of reading exist. One theory unites perceptual and cog- 
nitive elements, a second set of theories relates to information processing. 
A third theory, the most recent, concerns inferencing in an interactive mode. 
We will describe the essence of these theories and their implications for 
reading assessment. The greatest emphasis is on the last theory because of 
its major implications for reading assessment. 

Perceptual-Cognitive Theory involves Strategies 

Gibson and Levin (1975) presented a theory that is both perceptual and 
cognitive. Although they stressed most heavily the perceptual underpinnings 
of the reading process, these researchers also indicated that psychological 
processes of mature reading go far beyond perception to remembering problem 
solving, and organization of conceptual knowledge for better extraction of 
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meaning* Two implications of perceptual-cognitive theory for reading assess- 
ment are (1) the cognitive strategies that help people improve their reading 
can also be used to help people perform better on reading tests, and (2) these 
strategies are not innate but can be taught. 

information Processing Theories Analyze Reading Components 

As noted by Carpenter and Just (in preparation), information processing 
is characterized by efforts to understand what information is represented in 
memory, what information is acquired, how processes are acquired and invoked, 
how long these processes take, and what sources of error exist. A major con- 
tribution of information processing theory is that it acknowledges the role of 
environment (not just previously acquired knowledge and thought patterns) in 
explaining reading behavior. The information processing model of Sticht, Beck, 
Hauke, Kleiman, and James (1974) emphasizes the interaction between the envi- 
ronment and cognitive components such as sensory information storage, short- 
term memory, and long-term memory to generate literacy skills. Massaro f s 
(1975) information processing model of reading delineates four component 
processes or stages between the language stimulus and the meaning response: 
feature detection (sensation), primary recognition (perception), secondary 
recognition (conception), and recoding and rehearsal, information processing 
analyzes reading into component processes; however, the parts are not always 
easily reconstituted into the reading act (Carpenter & Just, in preparation), 
information processing theory is now being used in computer-reading simula- 
tions, in which successful computer programs must have a great deal of "knowl- 
edge* about vocabulary, language structure, and the topic of the text. Car- 
penter and Just (in preparation) described some recent computer simulations of 
reading based on information processing. 

information processing theories of reading have some implications for 
reading assessment. First, these theories imply that environment and short- 
and long-term memory are very important in the reading process and that there- 
fore reading is not a simple thing to measure. Second, one application of 
information processing theory (that of Sticht et al., 1974) has shown that 
reading tests need to be functional (i.e., related to the person's environ- 
ment) for results to be most meaningful. 

interactive-Inferential Theory Provides a New Perspective 

Although the emerging reading theory lacks a consistent, descriptive 
name, we will call it interactive-inferential theory. This theory builds 
largely on the information processing model but also goes beyond it by 
demonstrating the interaction between higher order, interpretive, metacog- 
nitive processes and less complex processes, interactive-inferential theory 
focuses on interactive rather than linear movement in reading and has gener- 
ated a spate of research that centers on reading comprehension instead of 
decoding. We devote more time to interactive-inferential theory than to 
earlier theories, because it offers a more "evolved" view and a number of 
implications for reading assessment. According to this theory, reading (1) is 
an active process in which the reader constructs meaning through Inference and 
interpretation, (2) is purposeful and hence involves motivation, (3) can be 



improved through use of cognitive and learning strategies, (4) is processed by 
the individual through interactions among several levels of information, and 
(5) can be divided into a set of subskills that fit into a unified process but 
that are not as yet fully defined or understood (Parr, Carey, & Tone, in prep- 
aration; J. orasanu, personal communication, January 12, 1984). Each of these 
characteristics of reading is discussed belov, along with its indications for 
reading assessment. 

Action, interpretation and inference . The reader actively constructs 
meaning from text cues by calling on knowledge of language, text structure, 
writing conventions, and the topic itself. The reader applies inference and 
interpretation to go beyond what is given (Parr, Carey, & Tone, in prepara- 
tion). Researchers have studied text cues through methods of text analysis 
and text linguistics. The active, inferential, interpretive nature of reading 
creates at least seven implications for reading assessment. 

Pirst, reading comprehension tests may be most valid when they are func- 
tional, that is, when they involve materials similar to those needed to per- 
form everyday and vocational tasks (Miller, 1973). "Reading assessment ought 
to reflect the schema domains, syntax, vocabulary, style, and structure of 
materials that will need to be read by the individuals taking the tests" 
(Parr, Carey, & Tone, in preparation, p. 24). As a consequence, job-relevant 
reading tests of a criterion-referenced nature (i.e., keyed to a specified 
criterion or standard of performance) may be more useful for Army populations 
than are other kinds of reading tests, at least for the purpose of assessing 
comprehension. However, if a test is given with the intent of ranking group 
members and not with the intent of carefully assessing conprehension, then it 
might be acceptable to use a norm- referenced, general reading test that is not 
job related. 

Second, in order to employ a somewhat uniform cognitive schema, reading 
tests need to be developed, tested, and normed, using populations similar to 
those who will take the test. Por example, an adult reading test should not 
be developed using just children, and items developed for children may not be 
relevant for adults. These facts, while seemingly straightforward, are over- 
looked with surprising frequency. 

Third, the active, inferential, interpretive characteristic of the new 
theory implies that tests should be appropriate to the examinees insofar as 
possible. Therefore, adaptive (tailored) testing may be useful. Adaptive 
testing allows the individual to take only items that are at a relevant dif- 
ficulty level and may shorten test administration time (McBride, 1979). The 
joint services are now designing and testing for the ASVAB computerized adap- 
tive testing, which could be used for adult reading tests as well. 

Pourth, because readers of various cultural backgrounds can interpret 
reading passages differently and come to different conclusions, test bias 
(known as "differential validity") can occur unless background knowledge is 
controlled or explained. However, because reading comprehension depends 
greatly on background knowledge, such knowledge cannot be artificially fac- 
tored out or eliminated by using esoteric content fFarr, Carey, & Tone, in 
preparation) . 
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Fifth, reading tests of literal recall are not as revealing or lifelike 
as reading tests that demand some sort of inferencing. Old-fashioned tests of 
literal recall are therefore to be avoided. 

Sixth, the theory's encouragement of relevance of the test to the exami- 
nee indicates that child- based scores such as reading grade levels (grade 
equivalents) are not useful for adults, although those scores are ubiquitous 
in the armed forces (Farr, Carey, 6 Tone, in preparation; Miller, 1973). 

Seventh, text linguistics as used in the new theory can provide guidance 
on how to create more reliable reading test items, that is, items that accu- 
rately and consistently measure the skill in question. Sample guidelines 
include avoiding items that require stylistic and other ambiguous judgment; 
not testing for incidental, insignificant information; not using harder vo- 
cabulary in questions than in text; and avoiding list-like density of ideas in 
test items (J. Orasanu, personal communication, March 9, 1984). 

In addition to the active, inferential, interpretive aspect of the new 
theory, other aspects — purposefulness, cognitive strategies, nonlinearity, and 
subskills — also have implications for reading assessment. 

Purposefulness . The fact that reading is purposeful and involves moti- 
vation implies that reading test developers should select and present test 
passages that engage the reader in a valid purpose, not just the purpose of 
passing the test (Farr, Carey, & Tone, in preparation). The interest level 
and degree of relevance must be kept high to maintain the reader's sense of 
purpose. A negative sense of purpose (e.g., *If I don't pass this reading 
test I may not be allowed to reenlist*) may have a detrimental effect on the 
performance of some reader? but not of others. Functional reading tests might 
instill a greater sense of purpose than more general reading tests do. 

Cognitive strategies . An implication of the existence of cognitive stra- 
tegies is that such strategies can improve not only reading in general but 
also performance on reading tests. Different strategies useful for a variety 
of purposes and genres can be taught. Such strategies can develop readers 9 
sensitivity and can thus improve comprehension of a passage in a reading test 
or any other text (Brown & Armbruster, in preparation). Cognitive strategies 
are also important in earlier theories, such as that of Gibson and Levin 
(1975). 

Nonlinearity . Readers use interactions among several information levels, 
moving • top-down* and • bottom-up' and mixing higher order inference with simp- 
ler perceptual processes. Because of the nonlinear, complex interaction that 
occurs, reading should not be assessed as the automatic decoding of a sequen- 
tial string of letters. 

Subskills . The last few decades have seen a proliferation of subskills 
in reading tests (Farr, Carey, & Tone, in preparation), despite lack of ade- 
quate subskill definition, lack of a coherent reading theory from which sub- 
skills derive, and lack of & consensus on how many items are needed to measure 
a particular subskill. The new theory may yet lead to a better understanding 
of these subskills and how they can best be measured. 
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Summary of Theories 



We have discussed three types of reading theories: perceptual-cognitive, 
information processing, and interactive-inferential. These theories all have 
important implications for reading assessment. Some measure of agreement 
exists across theories on the importance of examinee context, background 
knowledge, and cognitive strategies for reading. The last theory seems to be 
the most helpful in offering concrete implications for reading measurement. 
We will now discuss psychometric qualities of available reading tests in light 
of these implications. 



PSYCHOMETRIC CHARACTERISTICS OF AVAILABLE TESTS DIFFER 

in this section, the standards implied by reading theory will be applied 
to the review of reading tests currently available for Army use. This review 
will cover both commercially and militarily developed reading tests. Theory 
and practice tell us that good reading tests should have certain psychometric 
characteristics, including appropriate validity, reliability, norms, and 
standards. All reading theories imply the need for valid measures, that is, 
tests that measure what they purport to measure. The emerging interactive- 
inferential reading theory particularly emphasizes the need for a type of 
content validity (or at least content relevance) that reflects the schema the 
reader ordinarily employs. The need for reliability of measurement is implied 
in the text-linguistic basis of this new reading theory. Appropriate norms 
(for norm-referenced tests) and appropriate standards (for criterion- 
referenced tests) are also implicit in the emerging theory. 

Armed Services Have Used Commercial Reading Tests 

Standardized, norm-referenced, commercial reading tests have dominated 
the armed forces market in the past, widely used tests have included the 
Adult Basic Literacy Examination (ABLE), the Test of Adult Basic Education 
(TABE), the Nelson-Denny Reading Test, and the Metropolitan Achievement Test 
(MAT), Reading, Form D. All of these tests have been reviewed elsewhere 
(Buros, 1972, 1975; Oxford-Carpenter & Schultz, 1983). We will provide a 
brief assessment of these tests in light of the standards implied by reading 
theory. 



The ABLE is one of a few commercially available, psychometrically sound 
reading tests originally designed for and normed on adults. From a purely 
technical standpoint, the ABLE appears to be a good commercially developed 
reading test for Army use. The ABLE has well-documented reliability (high 
.80s and .90s). Concurrent validity is based on administration of the ABLE 
and the Stanford Achievement Test to elementary and junior high school stu- 
dents (.60-. 76) and to an adult job corps group (.36-. 72). The test was de- 
signed for adults with varying achievement levels and for adults who have not 
completed formal eighth-grade education. Norms are based on 6,000 elementary 
and junior high school students, 800 job corps members, and 450 adult basic 
education students. Vocabulary, reading, spelling, and arithmetic subtests 
are available in two parallel forms. The reading subtest focuses on compre- 
hension. The ABLE was originally selected for use in the Army's Basic Skills 



Education Program (BSEP). However, the test proved unpopular with some BSEP 
personnel for reasons related to administration not technical quality (Raines, 
1983). The ABLE was replaced by the TABE for BSEP use about five years ago* 

Though possessing some technical merit, the TABE was chosen more on prac- 
tical than technical grounds* The TABE covers reading, language, and arith- 
metic; its reading subtest contains both vocabulary and comprehension* The 
test is actually a revision of the California Achievement Test (CAT), which 
was designed for children* TABE developers removed patently childish refer- 
ences from the CAT or changed them to adult references. TASE subtests have 
adequate reliability (in the high .70s through .90s). Validity documentation 
for the TABE is not compelling, because its concurrent validity is based on a 
correlation of .56 with the General Educational Development examination using 
a small sample. The idea of "inherited* validity from the CAT is spurious. 
Furthermore, the TABE lacks adult norms, and its use in testing adults in the 
Army has drawn strong criticism (Bachem, 1982). Clearly, key career decisions 
for soldiers should not be made on the basis of children's norms, such as 
those of the TABE. However, the TABE has been useful in BSEP for diagnosing 
soldiers 1 strengths and weaknesses. 

The Army has frequently used the Nelson-Denny Reading Test and the MAT. 
Designed to assess reading for college placement and adult reading classes, 
the Nelson-Denny has adult norms. The MAT was given a new, Army cover and 
used as the U.S. Armed Forces Institute (USAFI) Reading Test with few changes 
in the civilian-to-military transformation. Its norm group includes no adults. 
For both tests reliability is strong, but validity is not. The Army has also 
occasionally used other commercial reading tests, such as the Gates-McGinitie 
Reading Test and the Iowa Test of Basic Skills. A review of these commer- 
cially developed reading tests is presented in the Appendix. 

These tests need to be evaluated on the basis of whether they provide 
appropriate validity, reliability, and norms demanded by reading theory and by 
practicality. The Army particularly needs to examine the validity of commer- 
cial reading tests in light of Army needs. Although a given reading test may 
be highly valid for a group of school children, it may not be valid for adults 
in an intense, job-oriented Army setting. Furthermore, the issue of standards 
is pertinent to commercial reading tests as used in the Army. Most commercial 
reading tests are purely norm referenced and do not advertise acceptable stan- 
dards of performance. However, the Army has occasionally used these tests in 
a quasi-criterion-referenced way by designating a given grade equivalent test 
score as an acceptable minimum standard for soldiers. For example, ninth-grade 
reading level has become a magic number to some Army leaders. Before any 
standards are chosen, those standards need to be shown to correlate highly 
with Army job performance (not just with Army job materials). Insufficient 
research has been done in this fruitful area. 

Armed Forces Have Developed Their Own Reading Tests 

While the armed forces have commonly used commercial, norm-referenced 
reading tests, the military has also created reading tests. The Army has taken 
the lead in much of this test development. Throughout the 1970s and the early 
1980s, the Army worked on a series of job-related reading tests for Army en- 
listed personnel (Claudy & Caylor, 1982; Sticht, 1975, 1982; Sticht, Hooke, & 
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Caylor, 1982; sticht, caylor, & James, 1978 ; and Sticht, Cay lor, Kern, & Pox, 
1971 )• These efforts produced the Job Reading Task Tests (JRTT) and the Job 
Reading Test (JRT), which are MOS-specific and emphasize functional literacy. 
Unlike the JRTT, the JRT is normed and machine-scorable. As mentioned 
earlier, the Army is currently involved in a massive test development under- 
taking under the Job Skills Education Program (JSEP), which will provide 
computerized, job-related testing in basic skills. The other armed services 
have also developed their own reading measures, but the efforts appear to have 
been independent rather than coordinated. A leading developer of Army reading 
tests stated that the Army's own reading tests have never been accepted be- 
cause of lack of internal advocacy. Support existed for test development but 
not for test use (T. sticht, personal communication, August 16, 1983). A re- 
view of selected militarily developed reading tests is found in the Appendix. 

An important merit of some reading tests developed by the military is 
also one deficiency: job-specificity. The emerging reading theory encourages 
the use of functional, job-related reading tests. However, because approxi- 
mately 350 jobs exist in the Army alone, the task of developing just one test 
form per job is a monumental endeavor. Of course, to enhance reliability 
there should be several test forms per job. Furthermore, rapid technological 
change in job content means that job-related tests need to be continually 
updated. Such an endeavor is obviously very costly and difficult, other 
options might be to develop reading tests for career management fields instead 
of jobs or to develop reading tests for only the highest density jobs. Even 
these options are not simple. 

Given the issues associated with use of commercial and military reading 
tests, the Army has occasionally sought a surrogate measure of reading, a test 
that can be said to measure reading while not actually measuring it. Many Army 
administrators have looked longingly toward the ASVAB as such a surrogate 
reading test. 



ASVAB SCORES ARE CORRELATED WITH READING TEST SCORES 

Because most reading tests at present cannot adequately account for the 
role of such factors as background knowledge and reasoning ability, such tests 
may be seen as measures of intelligence and experience rs well as measures of 
reading ability (Farr, Carey, & Tone, in preparation). Consequently, it is 
not surprising that reading tests are often highly correlated with general 
measures of aptitude or "trainability* (Jensen, 1981), such as the ASVAB. 
Like similar tests, the ASVAB is "reading-dependent, - that is, reading is 
necessary but not sufficient to perform well on the test. Moderate to large 
correlations have been demonstrated between the ASVAB and several reading 
tests. For example, Sticht (1975) showed a .65 correlation between an 
unspecified reading test and the Armed Forces Qualifying Test (AFQT), which 
covers four key ASVAB subtests. The Job Reading Task Tests developed by 
Sticht for the Army also have moderate (in the .50s and .60s) correlations 
with the AFQT (Sticht, 1975). Fischl (1981) found that the USAFI Reading Test 
correlated .80-. 95 with various composites of the* ASVAB and with the total 
ASVAB for a sample of 600 soldiers. In a recent investigation involving 2,385 
Army and Marine recruits, a .85 correlation was found between the ABLE and the 
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General Technical (GT) composite of the ASVAB (P. Grafton, personal communi- 
cation, August 15, 1983). Among several groups of soldiers whose English 
proficiency was limited and who were headed for Army Bnglish-as-a-second- 
language (BSD training, Oxford-Carpenter (1982) found very low correlations 
(in the teens and .20s) between the ABLE and a number of ASVAB composites. 
After BSL instruction, the correlation between the same soldiers 9 ABLE and 
ASVAB scores rose to the .40s and .50s. Zn summary, reading tests have been 
shown to correlate moderately to highly with the ASVAB as long as language 
problems (for individuals with limited proficiency in English) do aot 
intervene. 

A large verbal element clearly exists in the ASVAB (Valentine, n.d.), and 
some researchers have conjectured that the ASVAB indirectly measures reading 
ability. In fact, the Army has devised a so-called literacy index for the 
ASVAB by equating the ASVAB-GT with a composite of three ABLE subtests, read- 
ing, vocabulary, and arithmetic reasoning (P. Grafton, personal communication, 
August 15, 1983). The index involves ABLE grade equivalent scores, which, 
while apparently simple, have some complexities that are discussed next. 



GRADE EQUIVALENT SCORES CAN BE MISLEADING 

Grade equivalent scores, popularly known in the military as reading grade 
levels when applied to reading tests, are the most used and abused of all scor 
types. The emerging interactive-inferential reading theory implies that read- 
ing tests should be as appropriate as possible to the examinees. Therefore, 
use of grade equivalent scores with adults is potentially misleading (Miller, 
1973). Bachem denounced the use of grade equivalent scores in the military 
setting: "The use of elementary school grade levels to categorize adult com- 
bat soldiers seems little short of an insult, no matter how desperate their 
need for remedial work may be" (Bachem, 1982, p.4). Grade equivalent scores 
are not applicable to high school or adult levels because these scores have 
ambiguous meaning beyond the earlier years of constant growth (Ysseldyke 6 
Marston, 1982), and because they compare adults not with their peers but with 
children. Even using grade equivalent scores with children can be Misleading 
(Oxford-Carpenter fc Schultz, 1983). Scores other than grade equivalents are 
abundant, and many are potentially useful for reading assessment in the Army. 
Some useful norm-referenced scores include percentile ranks, standard scores, 
stanines, and some types of adjusted gain scores, criterion-referenced scores 
which are frequently encountered in Army training programs, include number of 
objectives mastered or passed, number of trials to mastery, and time to mas- 
tery. These scores may also be useful for Army reading measurement. The 
emerging theory of reading suggests that criterion-referenced scores may be 
very helpful, because criterion-referenced tests are geared toward specific 
objectives and have relevant standards and may therefore be more germane than 
other types of tests. However, the theory does not rule out use of norm- 
referenced tests and their associated scores. 
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SIX MAJOR CONCLUSIONS ARE REACHED 

Six major conclusions may be drawn from both theory and practice. First, 
reading is important in military and civilian work life. Second, reading as- 
sessment is a highly visible and important issue in the Army. Third, reading 
theories— especially the interactive- inf erential theory—can positively influ- 
ence reading measurement practices in the Army. Fourth, reading tests differ 
widely in terms of psychometric characteristics and overall quality as evalu- 
ated using theory-based standards. Fifth, high correlations exist between the 
ASVAB and various reading tests, although caution needs to be exercised in 
using any part of the ASVAB as a reading surrogate. Sixth, alternatives to 
grade equivalent scores are available and should be considered for use by the 
Army. 
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Table 1 



Review of Commercially Developed Reading Tests 



TOTSOUES 



KVaOfEK 

on 



LATEST 



MAMMTEMOQ) 



NfTENOEOTEfT 



Aduft Basic Uarrv 
ing Earn (ABIE) 1 

AND 


Corporation 




To determne general 
aducaifon level of adufts; 
to evaJua* aduft educa- 
tion programs. 


Adurtswrth achieve- 
ment lev* grades 

KM* 
adults who have not 
completed formal 8th 
grade education. 


Grade «*irvalents, 
based on equating 
ABLE to Stanford 
Achievement Test 
(SAO using scores of 
•vusncs ai graoes 
2-7; authors encour- 
age us* of local 
norms (percentiles 
andstanines). 


WUDfTT 

ConcurrertvaidHylowtorix)deratB 
based on: (t)aoVnWstrationof ABcE 
and SAT to elementary and junior 
Ngh students (j60-.7$); ft admtts- 
tration of ABLE and SAT paragraph 
meaning to Job corps group (26-.7Q. 


SetectABLE 


PsychofogicaJ 
Corporation 


1974 


To screen for determma- 
bon of which ABLE level 
toadmWstet 


Same as for ABIE. 


Raw score, percent 
correct 


MA 1 


Carver-Darby 
Chunked Reading 
Test 


RevTac Pubficabon 
(developed by 
Arnencan Institutes 
tor Research) 


1972 


"fo measure Wbrmattoo 
stored dicing reading. 


Grades 9-16 and 
aduto. 


Efficiency, accuracy, 
end rate scores; no 
standardtoed scores. 


Vafidity best viewed as pflot study 


Gates-McGinroe 
Reading Test 


Hwjghton Mfftwi 


1978 


% assess three areas of 
readng. 


Grades 1-12. 


Raw score, extended 
standard score, 
percentile* 


Contant va&Sty suitable; may have 
high verbaUG component (has high 
correlation with Urge Thomdfce 
verbal W). 


Gray Oral Reading 
Test 


BoObs-MerrB 




To assess oral reading 


Grades 1-lfianf 
adutt. 


vxioe equivalents 
(total score onry). 


INA' 


Metropolitan 
Achievement Test 2 


Psychological 
Corporabon 


1978 


To assess achievement 
inanumoerof sJdB 
areas. 


GradM 25*95 for 
reading subtest. 


Scaled score, per- 
centiles, stenlnes, 
graoe eo^rvajents. 


Cor^vifidrrygood.nopreo^ 
or other vafidrty rnentioned In 
reviews; test authors suggest local 
curriculum vaJtfty be ctocfcM. 


Nelson-Denny 
Reading Test 


Houghton- Mrffiin 


1973 


% assess naolng for 
colege placement and 
adutt reaolng classes. 


Grades 9-16 and 
adults. 


Percentiles, grade 
eounalents. 


Uttie data on cencurrent and predic- 
tfve vaWrty; nothing on content or 
construct vaiidrty. 


RBHBasic Read- 
ing and Vtord Test 


Ricrta.*dson, 
Bellows. Henry & 
Company 


1969 


% assess reaolng com- 

prahensJonand 

vocaetfary 


Disadvantaged adutts 


Percentiles, standard 
scores. 


Lacks convincing validity data; no 
statistics. 


RBHTestof 

Reading 

UHnprcnenSJon 


Richardson, 
Bellows. Henry & 
Company 




"fo tost reedfrg com- 
pnjhension using six 
articles related to busi- 
ness and Industry. 


Business and 
Industry. 


Percentiles, standard 
scores. 


Licks convincing vafidity data; low 
vaaofty coefficients (.17-.45). 


Tests of MM 
Basic Education 
(IABE) 


CTB/McGraw-HiS 


1976 


measure reaolng and 
artihmetic levels of 
adiAs using an adapted 
version of a children's 
test 


Adufts reading at 
levels of children in 
grades 2-4 
(level E), 4-6 
(level M), 7-9 
(Level D). 


Grade equivalents 
based on Cafitorrtfa 
Achievement Tests 
(CAT). 197U 


Content veidrfy based onry on Item 
Section procedures from CAT. Ques- 
tionable va*drty Coocurrem validity 
.56 with GEO test 


Wide Range 
Achievement lest 

LevetlP 

t-lntofmat)onnotav 


Jastak Assessment 
Systems 

aHabto 


1978 


assess oustty three 
dbcrete areas of 
achievement 


12 years to adutt. 


Standard scores, 
graded eo^jrvalents. 
percertbies based on 
age. 


Content validity questionable. 
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Table 1 (Continued) 
Review of Commercially Developed Reading Tasts 



MOUP 

vt summon rrai 

muAntmr worm momouAi mm scoring rmms kokakas quality 



to exelem lor school 
group (£7-55). job corps 
group (-85-56). and 
sto basic educations 4 ^ 

/Wife / O1..0J1 tin U4v. 

mation on tast-ftttst or 
otfttf reiabCty 


Based on WOO 
elementiry and 
Junior high stu- 
dents, 800 job 
corps members, 

VKJ *OU •OUU MSC 

education students. 


Group 


LeveM: USmn 
fevel2: U5min 
Level 3: 207 rnm. 1 


Hand or Machine 


A&B 


vocabulary, readmg. 
speJSng. aranmebc. 


reams appear technfcaBy 
weJ constructed. Most 
items relevant to adults. 
Readmg Items are dcae 
types and do not mea- 
sure el facets of 
reading-Just readmg 
comprehension. 


KR-20 ^4-87. spfit-haif 
-86-51. 


452mStary 
recrurts. 


Group 


15rrtn. 


Hand 


One Form 


Total 


Adequate 


RehabKty best viewed as 

pBotstucV-nothigh 

(.43-31). 


Ibtaffy unsatisfac- 
tory; means and 
standard deviations 
cviudw on 4i 

toBege student 
volunteers". 


Group 


25 mm. optional 


Machine 


A&B 


Efficiency, accuracy, 
rata. 


Orvides sentences into 
chunks but without a 
consistent theoretical 
oasts reBUM to c4 
readmg act. 


Alternate form refiabSty 


Norming procedures 
good 


Group 


Level A,B: 50 mm. 
uevei u. -AJ mm. 
LevelCS: 7 min. 
Level Cfc 40 mm. 
Level E: <4 mm. 


Hand or Machine 


1. 2.3 


SptNoVaccuracy. 

vocabulary, 

comprehension. 


Adequate 


ouikhiu errors ui mcr 

surement adequate. 


rtviHo D9SQO on 
very sma samples 
(N«40 per grade). 


IUA1 


Mat 


tun 


A D f** ft 

A, o» U, U 


Comprehension, total. 


items sasstactory to 
good; oral readmg on/). 


Internal cr.Tststency refia- 
b*ty 50 + . standard 

cuurs or nicisurcrricni 

exceBent. 


Norm Koup care- 
fufly seiccted. 


Group 


Level P-2: 16O-170 
min.; 

LBVBI t. CO-CSo 

min.; 

Level 1*270 min ; 
level A: 265 min. J 


Hand or Machine 


J&K 


knowledge, 
readmg, ttal word 
ararysis (or language, 
decondmgonlest 
leveO- spettng, mathe- 
matics (computation, 
concepts problem serv- 
ing, total). 


Items written lor 
children not adults. Con- 
tent problems because 
content remote from 
aduftWe. 


High alternate forms 
reWity for most sub* 

samples, however. 


Standirdteabon OK; 
adutt norms based 

Art f*t*t ft rr\A Willi- 

On CW wiK Nirafr 

istratlon only 


Group 


30-35 mm. 


Hart or Machine 


A&B 


vocabulary, compre- 
hension, total, rata. 


Mostly we8 constructed 
items. 


Lacks convincing refiabft- 
rtydata(j86prooabryan 
overestimate). 


INA< 


Group 


25-30 mm. 


Hand 


One Form 


Total 


Items cover mostly 
vocabulary. Choice of 
vocabulary tested b 
questionable. 


RefiabSty data not ade- 
quate; no means and 
standard deviations 
given. 


Test probably was 
too easy for norm 
group; norms not 
adequately 
standardized. 


Group 


25-25 mm. 


Hand 


One Form 


Total 


INA 1 


KR-20 for reading test 
stcbon 36-54; test- 
re test for rearing 


fvo adult norms; 
norms based on 
students in grades 

2-a 


Group 


Level E: 127 min. 
Level M: 209 mm. 
Level 0: 151 mm * 


Kant: 


3&4 


Rearing (vocabulary, 
comprehension, total). 
ArWimetic (reasoning 
hjno^manteJs. total). 
Language (mechanics, 
spaing. totaO-for top 
two levels. Total: (Loca- 
tor tastatoavalatoie). 


Items were written for 
cfdoren but have been 
revised to omit purely 
chidrsh references. 


High reported reSabffibes 
(58) ve suspect. 


No national norming 
sample. Norms 
developed from 
continuous age 
data. Identity and 
nature o* norm 
group not deai 


Part 

individual. 


20-30 mm. 


Hand 


One Form 


Soelmg, arttawflc. 
reaclng (baaed on 
finical factor 
anarysh"). 


Questionable Item devel- 
opment and quality. 
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Table 2 

Review of Selected Military Developed Reading Tests 



DEVELOPER 



TEST SERIES 


OR 

PUIUSHER 


LATEST 
COPYRIGHT 


MAIN INTENDED 
PURPOSED 


INTENDED TEST 
POPULATION 


SCORES 


VALIDITY 


Air Force Reading 
Abdrty Test 
(AFRAF) 1 


US Ail Force 


\m 


To assess reading ability 

MWM fair fn arvr**It*iit 

readers in the Air Force. 


5th grade through 
coOege level; not 
intended for poor 
readers 


GE, percentile. 


Concurrent validity is .72 with Califor- 
nia Achvt. lest, .75 with Neison- 
Oenny. Predictive valttty moderate 
(.40 in predfctinc Officer Training 
School grades, -.13-J61 in predicting 
technical training grades). Compre- 
hension is better predctor than 
vocabulary. 


Job Reading Task 
Tests 4 


U.S. Army 


circa 

1971-1973 


To assess preiorrnance 
on Army job reading 
tasks with items bared 
on 3 military occupa- 
tional specialities 
(MOS); for research pur- 
poses onfy 


Army enlisted 
personnel. 


Percent correct, 
reading grade level. 


Moderate (j64-j80) concurrent vafldrty 
wtth standantoed achvt test Lower 
concurrent validity with AFGT. 
Moderate correlations wtth course 
grides and job knowledge tests. 


Job Reading Task 
Tests 4 


U.S Army 


circa 1975 


To assess performance 
on Army job reading 
tasks with items based 
onBMOS. 


Army enlisted 
personnel. 


GE, percentiles. 


Content vafldrty good, based on items 
tomemptncafly determined job 
rating tasks; concurrent validity 
ranges JB9-.79 with USAF1RGL 
criterion. 


Job Readmg Tests 5 


U.S Army 


circa 1962 


lb assess performance 
on Army job reading 
tasks with items based 
onBMOS. 


Army enlisted 
personnel. 


fcrcentile. 


validity data not convincing . Item 
validity appears adequate. Onfy infor- 
mation on test valkfity is that the con- 
tent is drawn carefuOy from 6 MOS. 


U& Armed Forces 
Institute (USAFI) 
Reading Test 














(same as 
Metropolitan) 















1 - Information from Mathews (personnel communication, September 7, 1983) and valentine (nd.) 

2 - Information not available from published source 

3 - Or.: :„-,r\ assumed unless more than one is specified 

4 - Information from Stocrrt (1975) 

5 - Information from Ctaudy & Cayfor (1982) 
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Table 2 (Continued) 
Review of Selected Military Developed Reading Tests 



vT SUITESTSOR ITEH 
INDIVIDUAL TIMING SCORING FORKS' SCORE AREAS OUAUT 



Internal consistency 
reliability appro. 50 for 
whole test. 


Adult male and 
female Air Force 
and Army enlistees. 


Group 


50min total test. 


Hand or Machine 


A, B 


Comprehension, 
vocabulary, total. 


\tocabulary and com- 
prehension items above 
average in quality 


Test-retest reliability 
75-£0l No other reliabil- 
ity inforrroton. 


INA* 


Group 


INA* 


INA> 


Three forms; cook, 
supply clerk, vehi- 
cle repairman. 


Tables, standards and 
specifications, Identifi- 
cation and description, 
procedural (fractions, 
checkpoints, tunc- 
tional description. 


INA* 


KR-21: alternate 
forms: .6S-.76; SE of 
rnsmt: £2-36. 


750 young adult 
male Army recruits. 


Group 


1hr + 


Hand 


A,ftC 


Using index to locate 
information, extracting 
information from tables 
and narrative prose, 
folowlng procedural 
directions. 


Free-response, filMn- 
Mank items cause stow 
scoring. 


INA* 


Normtng appears 
adequate. 


Group 


30-40 min. 


Machine 


A.ac 


Locating job informa- 
tion ki tables, lodex. 
graphs, and narrative; 
forms completion. 


Item statistics appear 
adequate. Multiple 
choice items. 
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